Selecting Appropriate Representations for Learning from Examples
نویسندگان
چکیده
The task of inductive learning from examples places constraints on the representation of training instances and concepts. These constraints are different from, and often incompatible with, the constraints placed on the representation by the performance task. This incompatibility explains why previous researchers have found it so difficult to construct good representations for inductive learning-they were trying to achieve a compromise between these two sets of constraints. To address this problem, we have developed a learning system that employs two different representations: one for learning and one for performance. The learning system accepts training instances in the “performance representation,” converts them into a “learning representation” where they are inductively generalized, and then maps the learned concept back into the “performance representation.” The advantages of this approach are (a) many fewer training instances are required to learn the concept, (b) the biases of the learning program are very simple, and (c) the learning system requires virtually no ‘vocabulary engineering” to learn concepts in a new domain.
منابع مشابه
Learning Node Selecting Tree Transducer from Completely Annotated Examples
A base problem in Web information extraction is to find appropriate queries for informative nodes in trees. We propose to learn queries for nodes in trees automatically from examples. We introduce node selecting tree transducer (NSTT) and show how to induce deterministic NSTTs in polynomial time from completely annotated examples. We have implemented learning algorithms for NSTTs, started apply...
متن کاملFostering engaged and directed learning by activity foregrounding and backgrounding
We propose a design model for guiding learning in exploratory environments through representational choices. Selecting the appropriate representations at the correct granularity can foreground the salient activities and background the irrelevant.. We demonstrate how this model can be applied via learner-centred design to ensure that the engaging factors of an environment are preserved whilst th...
متن کاملReferent : Johan Bos Titel :
Computational semantics is the business of associating meaning representations with natural language expressions and drawing inferences from them. It is an area that has matured to a state in which we have seen the arrival of broad coverage systems capable of producing deep semantic representations for open-domain texts. Such systems however face the problem of selecting appropriate background ...
متن کاملSelection of Relevant Features and Examples in Machine Learning Selecting Relevant Features and Examples
In this survey, we review work in machine learning on methods for handling data sets containing large amounts of irrelevant information. We focus on two key issues: the problem of selecting relevant features, and the problem of selecting relevant examples. We describe the advances that have been made on these topics in both empirical and theoretical work in machine learning, and we present a ge...
متن کاملFrom Active to Proactive Learning Methods
In many machine learning tasks, unlabled data abounds, but expert-generated labels are scarce. Consider the process of learning to build a classier for the Sloan Digital Sky Survey (http://www.sdss.org/) so that each astronomical observation may be assigned its class (e.g. “pinwheel galaxy”, “globular galaxy”, “quasar”, “colliding galaxies”, “nebula”, etc.). The SDSS contains 230 million astron...
متن کامل